2 Classical Field Theory 



In what follows we will consider rather general field theories. The only guiding 
principles that we will use in constructing these theories are (a) symmetries and 
(b) a generalized Least Action Principle. 



2.1 Relativistic Invariance 

Before we saw three examples of relativistic wave equations. They are Maxwell's 
equations for classical electromagnetism, the Klein-Gordon and Dirac equations. 
Maxwell's equations govern the dynamics of a vector field, the vector potentials 
A^(x) = (A , A), whereas the Klein-Gordon equation describes excitations of 
a scalar field 4>{x) and the Dirac equation governs the behavior of the four- 
component spinor field ip a (x)(a = 0, 1, 2, 3). Each one of these fields transforms 
in a very definite way under the group of Lorentz transformations, the Lorentz 
group. The Lorentz group is defined as a group of linear transformations A of 
Minkowski space-time M onto itself A : M. — > M. such that 



^ = Ktx v 



(1) 



The space-time components of A are the Lorentz boosts which relate inertial 
reference frames moving at relative velocity v. Thus, Lorentz boosts along the 
x 1 -axis have the familiar form 



x ' = 



X — 



x° + vx 1 /c 
^l-v 2 /c 2 
x 1 + vx°/c 



X = X 

x 3 ' = 



^l-v 2 /c 2 

2 



(2) 



where x° — ct, x 1 — x, x 2 — y and x 3 = z (note: these are components, not 
powers!). If we use the notation 7 = (1 — u 2 /c 2 ) -1 / 2 = cosha, we can write the 
Lorentz boost as a matrix: 



/x°'\ 



V 3 '/ 



/cosha sinha 0\ I x°\ 

sinha cosha x 1 

1 x 2 

1/ \x 3 / 



(3) 



The space components of A are conventional three-dimensional rotations R. 

Infinitesimal Lorentz transformations are generated by the hermitian oper- 
ators 

Lf, u = i(x M <9„ - x v d^) (4) 
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where = They satisfy the algebra 

{L^v, Lpa] = iffvpLfia — ig^pLuu — ig^cL^p + igp a L U p (5) 

where g^ v is the metric tensor for Minkowski space-time (see below). This is 
the algebra of the group SO (3, 1). Actually any operator of the form 

M„ v = + (6) 

where are 4x4 matrices satisfying the algebra of Eq. 5 satisfies the same 
algebra. Below we will discuss explicit examples. 

Lorentz transformations form a group, since (a) the product of two Lorentz 
transformations is a Lorentz transformation, (b) there exists an identity trans- 
formation, and (c) Lorentz transformations are invertible. Notice, however, 
that in general two transformations do not commute with each other. Hence, 
the lorentz group is non-Abelian. 

The Lorentz group has the defining property that it leaves invariant the 
rclativistic interval 

2 2 ->2 2/2 ->2 ln\ 

X = X Q - X = C t — X (7) 

The group of Euclidean rotations leave invariant the Euclidean distance x 2 and 
it is a subgroup of the Lorentz group. The rotation group is denoted by SO(3), 
and the Lorentz group is denoted by SO(3, 1). This notation makes manifest 
the fact that the signature of the metric has one + sign and three — signs. 
We will adopt the following conventions and definitions: 

1. Metric Tensor. We will use the Bjorkcn and Drell metric in which the 
metric tensor g^ u is 



= <r = 



( 1 \ 

-1 

0-10 

\ -1 / 



(8) 



With this notation the infinitesimal rclativistic interval is 

ds 2 = dx^dx^ = g^dx^dx" = dx 2 - dx 2 = c 2 dt 2 - dx 2 (9) 

2. ^-vectors: 

(a) x^ is a contravariant 4-vector, x^ — (ct, x) 

(b) x^ is a covariant 4-vector x^ = (ct, —x) 

(c) Covariant and contravariant vectors (and tensors) are related through 
the metric tensor g^ v 

A» = g^A v (10) 

(d) x is a vector in R 3 

(c) p^ = (—,p)- Hence, p^ — % — p 2 is a Lorentz scalar. 
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3. Scalar Product: 

p-q = p t _ l cf =Poqo-p-q = Pi J ,q v g flv (n) 

4. Gradients: = an d <9 M = We define the D'Alambertian Q 

□ ^^9 2 ^ld t 2 -v 2 (12) 

From now on we will use units of time [T] and length [L] such that h = 
c = 1. Thus, [T] = [L] and we will use units like centimeters (or any other 
unit of length) . 

5. Interval: The interval in Minkowski space is x 2 , 

x 2 = x^ = x 2 -x 2 (13) 

Time-like intervals have x 2 > while space-like intervals have x 2 < 0. 

Since a field is a function (or mapping) of Minkowski space onto some other 
(properly chosen) space, it is natural to require that the fields should have simple 
transformation properties under Lorentz transformations. For example, the 
vector potential A 1 * (x) transforms like J^-vector under Lorentz transformations, 
i.e., if x' 11 = h^x v => A ,tJ -(x') = A^A v (x). In other words, A^ transforms like 
x* 1 . Thus, it is a vector. All vector fields have that property. A scalar field 
$(x), on the other hand, remains invariant under Lorentz transformations, 

$V) = ( 14 ) 

A 4-spinor ip a (x) transforms under Lorentz transformations. Namely, there 
exists an induced 4x4 transformation matrix 5(A) such that 

S^A -1 ) = S _1 (A) (15) 

and 

*'(Aar) = S(A)*(x) (16) 
Below we will give an explicit expression for S(A). 

2.2 The Lagrangian, the Action and and the Least Action 
Principle 

The evolution of any dynamical system is determined by its Lagrangian. In the 
Classical Mechanics of systems of particles described by the generalized coordi- 
nate q, the Lagrangian L is a differentiable function of q and its time derivatives. 
L must be differentiable since otherwise the equations of motion would not local 
in time, i.e. could not be written in terms of differential equations. An argu- 
ment a-la Landau-Lifshitz enables us to "derive" the Lagrangian. For example, 
for a particle in free space, the homogeneity, uniformity and isotropy of space 
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and time require that L be only a function of the absolute value of the velocity 
|t7j. Since \v\ is not a diffcrcntiable function of v, the Lagrangian must be a 
function of v 2 . Thus, L = L(v 2 ). In principle there is no reason to assume 
that L cannot be a function of the acceleration a (or rather a ) or of its higher 
derivatives. Experiment tells us that in Classical Mechanics it is sufficient to 
specify the initial position x(0) of a particle and its initial velocity v(0) in order 
to determine the time evolution of the system. Thus we have to choose 



L(v 2 ) = const + ^mv 2 



(17) 



The additive constant is irrelevant in classical physics. Naturally, the coefficient 
of v 2 is just one- half of the inertial mass. However, in Special Relativity, the 
natural invariant quantity to consider is not the Lagrangian but the action S. 
For a free particle the relativistic invariant (i.e., Lorentz invariant ) action 

must involve the invariant interval or the invariant length ds = cyj 1 — dt, 
the proper time. Hence one writes 



S = mc / ds 

J Si 

Thus the relativistic Lagrangian is 



(18) 



L = mc \ 1 — 



(19) 



and, as a power series expansion, it contains all powers of v 2 /c 2 . 

Once the Lagrangian is found, the classical equations of motion are deter- 
mined by the Least Action Principle. Thus, we construct the action S 



S 



dt L q, 



dq 
dt 



(20) 



and demand that the physical trajectories q(t) leave the action S stationary, i.e., 
SS — 0. The variation of S is 



SS 



dL dL dq 



(21) 



Integrating by parts, we get 



SS 



= f 



u dt 



dt Sq 



ftf 

J dt Sq 


dL 
dq 


'dL d 


( dL 


~dq~~ It 


\Wt 



dL 



dt 



(22) 
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If we assume that the variation 5q is an arbitrary function of time that vanishes 
at the initial and final times U and tf(8q(ti) = Sq(tj) — 0), we find that 5S = 
if and only if the integrand vanishes identically. Thus, 

These are the equations of motion or Newton's equations. In general the equa- 
tions that determine the trajectories which leave the action stationary are called 
the Euler- Lagrange equations. 



2.3 Scalar Field Theory 

For the case of a field theory, we can proceed very much in the same way. Let 
us consider first the case of a scalar field <fr(x). The action S must be invariant 
under Lorentz transformations. Since we want to construct local theories it is 
natural to assume that S is determined by a Lagrangian density C 

S = j d*xC (24) 

Since the volume element of Minkowski space d^x is invariant, L can only be a 
local, differentiable function of Lorentz invariants that can be constructed out 
of the field ${x). Simple invariants are $(x) itself and all of its powers. The 
gradient <9 M <I> = J^- is not an invariant but the D'alambertian d 2 <& is. d^Qd^Q 
is also an invariant under a change of the sign of <j>. So, we can write the 
following simple expression for C: 

C = hd^d^ - V(<5>) (25) 

where V($) is some potential (a polynomial function of <&). Let us consider the 
simple choice 

= im 2 $ 2 (26) 

where m = mc/h. Thus, 

C = - im 2 $ 2 (27) 

This is the Lagrangian density for a free scalar field. We will discuss later on in 
what sense this field is "free". Notice, in passing, that we could have added a 
term like d 2 <&. However this term, in addition of being odd under <f> — > — $, is 
a total divergence and, as such, it has an effect only on the boundary conditions 
but it does not affect the equations of motion. In what follows will will not 
consider surface terms. 
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The Least Action Principle requires that S be stationary under arbitrary 
variations of the field $ and of its derivatives Thus, we get 



SS 



d 4 a 



SC 
6$ 



<5$ 



5d u <5> 



(28) 



Notice that since £ is a functional of $, we have to use functional derivatives, 
i.e., partial derivatives at each point of space-time. Upon integrating by parts, 
we get 



SS 



d 4 x d,, 



SC 
Sd^ 



5$ 



d 4 x 5$ 



SC 



SC 

sh\& 



(29) 



Instead of considering initial and final conditions, we now have to imagine that 
the field $ is contained inside some very large box of space-time. The term 
with the total divergence yields a surface contribution. We will consider field 
configurations such that (5$ = on that surface. Thus, the Euler-Lagrange 
equations are 



More explicitly, we find 



and 



dC 



SC 



sc 

sb~® 



dV 
9$ 







SC 

»sb\M 



= d u d^ = <9 2 $ 



By direct substitution we get the equation of motion (or Geld equation) 

dV 



9^ + ^ = 



d<5> 



For the choice 



the field equation is 



dV 



= m 2 $ 



(□ + m 2 )$ = 



(30) 

(31) 
(32) 

(33) 

(34) 
(35) 



where | | = d 2 



1 d 2 



V 2 - Thus, we find that the equation of motion for the 



free massive scalar field $ is 

1 <9 2 $ 



c 2 dt 2 



- v $ + m 2 $ = 



(36) 



This is precisely the Klein-Gordon equation if the constant m is identified with 
^ . Indeed, the plane- wave solutions of these equations are 



(37) 



G 



where po and p are related through the dispersion law 



pi = p A c z + m 2 c 4 (38) 

which means that, for each momentum p, there are two solutions, one with 
positive frequency and one with negative frequency. We will see below that, in 
the quantized theory, the energy of the excitation is indeed equal to |po I ■ Notice 
that i- = ^ has units of length and it is equal to the Compton wavelength 
for a particle of mass to. From now on (unless it stated the contrary) I will use 
units in which ft = c = 1 in which m — fh. 

The Hamiltonian for a classical field is found by a straightforward gener- 
alization of the Hamiltonian of a classical particle. Namely, one defines the 
canonical momentum ±l(x), conjugate to the field (the "coordinate ") $(x), 

n(s) = (39) 

where 

*(*) = £(«) (40) 

In Classical Mechanics the Hamiltonian H and the Lagrangian L are related by 

H =pq-L (41) 

where q is the coordinate and p the canonical momentum conjugate to q. Thus, 
for a scalar held theory the Hamiltonian density H is 

h = n{x)i(x) - c 

= \n 2 {x)+ l -(y<$>{x)) 2 +V{${x)) 

(42) 

For a free massive scalar field the Hamiltonian is 

1 1 / — * \ ^ ill ^ 

H = -I?{x) + -[ V <S>{x)) +— $ 2 (x)>0 (43) 

which is always a positive definite quantity. Thus, the energy of a plane wave 
solution of a massive scalar field theory (i.e., a solution of the Klein-Gordon 
equation) is always positive, no matter the sign of the frequency. In fact, the 
lowest energy state is simply $ = constant. A solution made of linear superpo- 
sitions of plane waves {i.e., a wave packet) has positive energy. Therefore, in 
Field Theory, the energy is always positive. We will see that, in the quantized 
theory, the negative frequency solutions are identified with antiparticle states 
and their existence do not signal a possible instability of the theory. 
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2.4 Classical Field Theory in the Canonical (Hamiltonian) 
Formalism 

In Classical Mechanics it is often convenient to use the canonical formulation 
in terms of a Hamiltonian instead of the Lagrangian approach. For the case 
of a system of particles, the canonical formalism proceeds as follows. Given a 
Lagrangian L(q, q), a canonical momentum p is defined to be 



dL _ 
dq ^ 



(44) 



The classical Hamiltonian H(p, q) is defined by the Lcgcndrc transformation 

H(p,q) =pq- L(q,q) (45) 
If the Lagrangian L is quadratic in the velocities q and separable 



L = _ m( f _ V{q) 



then, H(pq) is simply given by 



H(p, q)=pq-( 



mq 



V(q)) = ^ + V(q) 



where 



dL 



p= — = mq 
oq 



H is identified with the total energy of the system. 

In this language, the Least Action Principle becomes 



SS = S J L dt = S J [pq- H{p, q)\ dt = 



Hence 

Upon an integration by parts we get 
dt 

which can only be satisfied if 



... .. . dH . OH 

dt dp q + p oq — op — oq — — 



dp 



dq 



= o 



q = 



dH 

dp 



P = - 



dH 
dq~ 



(46) 

(47) 
(48) 

(49) 
(50) 

(51) 
(52) 



These are Hamilton's equations. 

Let us introduce the Poisson Bracket {A, B} qp of two functions A and B of 
q and p by 

r , dAdB dAdB 

< A ' B >" s W7*-7*7fc (53) 
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Let F(q,p,t) be some differentiable function of q,p and t. Then the total time 
variation of F is 

dF _ OF dFdq dF dp 

~dt ~ ~dt + ~dq~di + ~dpdt ^ ' 

Using Hamilton's Equations we get the result 

dF _dF dFdH dF dH 
dt dt dq dp dp dq 

or, in terms of Poisson Brackets, 

dF dF 



dt dt 

In particular, 



+ {F,H,} gp (56) 



dq dH _ dq dH dq dH 
dt dp dq dp dp dq 

since 



U,H} qP (57) 



| = an, ,S, 

Also the total rate of change of the canonical momentum p is 

dp _ dpdH_ ^ dpdH_ = _dH_ 
dt dq dp dp dq dq 



since |£ = and |£ = 1. Thus, 



dp 



-± = {p,H} qp (60) 



Notice that, for an isolated system, H is time- independent. So, 

dH , N 

and 

dH dH r 

*- = -5r + ^- = ° ^ 

since 

{ir,fT} w = (63) 

Therefore, if can be regarded as the generator of infinitesimal time translations. 
In passing, let us also notice that the above definition of the Poisson Bracket 
implies that q and p satisfy 

{q,p} qp = i (64) 

This relation is fundamental for the quantization of these systems. 

Much of this formulation can be generalized to the case of fields. Let us first 
discuss the canonical formalism for the case of a scalar field $ with Lagrangian 
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density £($, d^, $). We will choose <E>(x) to be the (infinite) set of canonical 
coordinates. The canonical momentum H(x) is defined by 

II(x) = .- g f, r (65) 

If the Lagrangian is quadratic in the canonical momentum n(x) is simply 
given by 

II(x) = <9 $(x) = $(x) (66) 
The Hamiltonian density H(&, II) is a local function of $(x) and n(x) given by 

w($,n) = n(x)a $(x)-£($,9 $) (67) 

If the Lagrangian density C has the simple form 



C=-{d ll *) 2 -V{*) (68) 



then, the Hamiltonian density W($,II) is 

W = n$ - £($, $, Vi*) = V(x) + ^(v*(*)) 2 + V ($(x)) (69) 

The canonical field $(x) and the canonical momentum II(x) satisfy the equal- 
time Poisson Bracket (PB) relations 

{$(x,x ),n(y,x )}p B = S(x-y) (70) 

where 5(x) is the Dirac <5-function and {A,B}pb is now 

5A SB SA SB 



{A,B} PB = Jd 3 x 



<5<&(x, xo) SIL(x, xo) SIL(x, xo) £$(x, xo) 



(71) 



for any two functional A and B of $(x) and n(x). This approach can be 
extended to theories other than that of a scalar field without too much difficulty. 
We will come back to these issues when we consider the problem of quantization. 

2.5 Field Theory of the Dirac Equation 

We now turn to the problem of a field theory for spinors. Let us rewrite the 
Dirac equation 

Q\p fa 

ih— = — a ■ V* + Pmc 2 tf = H DirEiC ^ (72) 
at i 

in a manner in which relativistic covariance is apparent. The operator £?Dirac 
defines the Dirac Hamiltonian. 
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We first recall that the 4x4 hermitean matrices a and (3 should satisfy the 
algebra 

{oti,aj} =25ijl 
{a u f3} =0 
a}=f = 1 

(73) 

A simple realization of this algebra is given by the 2x2 block (Dirac) matrices 

a* J /?= (o -J J (?4) 

where the a"s are the Pauli matrices 

and / is the 2x2 identity matrix. This is the Dirac representation of the 
Dirac algebra. It is now convenient to introduce the Dirac 7-matrices which are 
defined by the following relations: 

7 = p 7 4 = Pa 1 (76) 

Thus, the matrices 7 M are 

-/) ' = 0) ("> 

and obey the algebra 

{7^,7"} = 2^1 (78) 
In terms of the 7-matrices, the Dirac equation takes the much simpler form 

lyfd*-™ )* = (79) 

where ^! is a 4-spinor. It is also customary to introduce the notation (Feynman's 
slash) 

4 = «m7 m (80) 
Using Feynman's slash, we can write the Dirac equation in the form 

(0-™)* = O (81) 

From now on I will use units in which h = c = 1. Thus energy is measured in 
units of (length) _1 and time in units of length. 

Notice that, if ^ satisfies the Dirac equation, then 

(i0 + m){i0 - m)V = (82) 



11 



Also, 



(83) 

where I used the fact that the commutator [7^, 7"] is antisymmetric with indices 
/j, and v. Thus, satisfies the Klein-Gordon equation 

(□+m 2 )* = (84) 

withQ= d 2 . 

2.5.1 Solutions of the Dirac Equation 

Let us briefly discuss the properties of the solutions of the Dirac equation. Let us 
first consider solutions representing particles at rest. Thus ^ must be constant 
in space and all its space derivatives must vanish. The Dirac equation becomes 

= (85) 
where t = xq (c = 1). Let us introduce the bispinors cf> and \ 



*=UJ ,S6, 
We find that the Dirac equation reduces to a simple system of two 2x2 equations 

.d<i> 

% - = 

1- = -m X 



The solutions are 



and 



^ 



(87) 



e- imt ( x I (88) 



Xi = e" nt ( J ) X2 = e 4mt ( J ) ( 89 ) 

Thus, the upper component <f) represents the solutions with positive energy 
while x represents the solutions with negative energy. The additional two-fold 
degeneracy of the solutions is connected to the spin of the particle. 
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More generally, in terms of the bispinors 4> and X the Dirac Equation takes 
the form, 

9<P , 1^ - 
I— = m<p + -a ■ VX (90) 



dt 



i 



<9y 1 -> 

i-£ = -m X +^-V<P (91) 

In the limit c — > oo, it reduces to the Schrodingcr-Pauli equation. The slowly 
varying amplitudes <f> and x, defined by 

X = ^ mt X (92) 

with x small and nearly static, define positive energy solutions with energies 
close to +m. In terms of <j> and x, the Dirac equation becomes 

»i = r • ^ (93) 

<9y 1 
i-£ = -2m X + -3 ■ (94) 

Indeed, in this limit, the 1. h. s. of Eq. 94 is much smaller than its r. h. s. . Thus 
we can approximate 

2mx « -<? ■ \/<f> (95) 
i 

We can now eliminate the "small component" x from Eq. 93 to find that <j> 
satisfies 

which is indeed the Schrodinger-Pauli equation. 

Conserved Current: Finally, let us introduce one last bit of notation. We 
define ^ by 

# = & 7° (97) 
in terms of which we can write down the 4-vector 

f = (98) 

which is conserved, i.e., d^j^ = 0. Notice that the time component of j M is the 
density 

j° = vj/ 7 °* = *t^r (99) 
and that the space components of j M are 

j = vj/^ = <ft 7 ° ^ = ( 10 q) 

which we can regard as a (conserved) probability current. 
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2.5.2 Relativistic Covariance 



Let A be a Lorentz transformation, ty(x) the spinor field in an inertial frame and 
\&'(a:') be the Dirac spinor field in the transformed frame. The Dirac equation 
is covariant if the Lorentz transformation 

< = K x « ( 101 ) 
induces a transformation 5(A) in spinor space 

= S(A) a ^ p (x) (102) 

such that 

i-f-^H ~ m ) ^ */>(*) = =* (<V^r - m) = (103) 

Notice two important facts: (1) both the field 'J and the coordinate x change 
under the action of the Lorentz transformation, and (2) the 7-matrices and the 
mass m do not change under a Lorentz transformation. Thus, the 7-matrices 
are independent of the choice of a reference frame. However, they do depend 
on the choice of the set of basis states in spinor space. 

What properties should the representation matrices 5(A) have? Let us first 
observe that if x' = Ax, then 

d dx v d _ / .-n" 9 
dx'^ ~ dx'i 1 dx v ~ ^ dx v 

for 

x lfl = A£ar" (105) 

Thus, qIjt is a covariant vector. By substituting this transformation law back 
into the Dirac equation, we find 

ir ^' {X>) = i7 " (A " 1) ^ M A W X )) ( 106 ) 



Thus, the Dirac equation now reads 

' dx 



i y(A- 1 );5(A)|^-mS (A)* = (107) 



Or, equivalcntly 



5- 1 (A) J7 ^(A- 1 )^5(A)^-m* = (108) 



This equation is covariant provided that 5 (A) satisfies 

5- 1 (A) 7 ^5 (A) (A- 1 );- 7" (109) 

Since the set of Lorentz transformations form a group, i.e., the product of two 
Lorentz transformations A 1 and A 2 is the new Lorentz transformation A*A 2 and 
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the inverse of the transformation A is the inverse matrix A -1 , the representation 
matrices 5(A) should also form a group and obey the same properties. In 
particular, 5 _1 (A) = 5 (A -1 ) must hold. Recall that the invariance of the 
relativistic interval x 2 = x^x^ implies that A must obey 

A*A* = = (110) 

Thus, A^ = (A- 1 )^. So we write, 

5(A) 7 ^5(A)- 1 = (A- 1 )^ 7 - (111) 

Eq. Ill shows that a Lorentz transformation induces a similarity transformation 
on the 7-matrices which is equivalent to (the inverse of) a Lorentz transforma- 
tion. For the case of Lorentz boosts, Eq. Ill shows that the matrices 5(A) are 
hermitean. However, for the subgroup 50(3) of rotations about a fixed origin, 
the matrices 5(A) are unitary. 

We will now find the form of 5 (A) for an infinitesimal Lorentz transforma- 
tion. Since the identity transformation is A£ = g£, a Lorentz transformation 
which is infinitcsimally close to the identity should have the form 

K + (A~X=^-^ (H2) 

where is infinitesimal and antisymmetric in its space-time indices 



(113) 



Let us parameterize 5 (A) in terms of a 4 x 4 matrix which is also antisym- 
metric in its indices, i.e., — —a^. Then, we can write 

5(A) = I- l - a ^ + ... 

5- x (A) = + 

(114) 

where I stands for the 4x4 identity matrix. If we substitute back, we get 

(7 - % -o^ + . . .) 7 A (7 + l -a a ^ + ...)= 7 A - + • • • (115) 

Collecting all the terms linear in u>, we find 

J[ 7 A , V ]^ = ^V (H6) 

Or, what is the same, 

[^,<r vX }=2i(g^ x -g^) (117) 
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This matrix equation is solved by 



ova = 7a] (US) 

Under a finite Lorentz transformation x' = Ax, the 4-spinors transform as 

*V) = S(A)* (H9) 

with 

5(A) =cxp[-^^H (120) 

The matrices a^ v are the generators of the group of Lorentz transformations 
in the spinor representation. While the space components ajk are hermitean 
matrices, the space-time components aoj are antihermitean. This feature is 
telling us that the Lorentz group is not a compact unitary group , since in that 
case all of its generators would be hermitean matrices, but that it is isomorphic 
to the non-compact group 50(3, 1). Thus, the representation matrices 5(A) are 
unitary only under space rotations with fixed origin. 

The linear operator 5 (A) gives the field in the transformed frame in terms 
of the coordinates of the transformed frame. However, we may also wish to ask 
for the transformation U (A) which just compensates the effect of the coordinate 
transformation. In other words we seek for a matrix U (A) such that 

tf'(a;) = E7(A)tf(a;) = 5(A)*(A" 1 x) (121) 

For an infinitesimal Lorentz transformation, we seek a matrix U (A) of the form 

U(A) = I- 1 -J^uj^ + ... (122) 
We wish to find an expression for J M „. We find 

(j-|j M „w'"' + ...)* = {I -- A a^ v + ...)* {x<>-ujIx v + ...) 

= (i- \o^ v + . . ^ (* - d p v uj p y + ...) 

(123) 

Hence 

*'(a;) = (j - \o^ v + x^ v d v + . . )j *(a;) (124) 
From this expression we see that J^ v is given by the operator 

J^v = + i{x^d u - x v dy) (125) 

We easily recognize the second term as the orbital angular momentum operator 
(we will come back to this issue shortly). The first term is then interpreted as 
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the spin. In fact, let us consider purely spacial rotations, whose infinitesimal 
generator are the space components of J^ u , i.e., 

Jjk=i{xjd k -x k d j ) + ^a j k (126) 
We can also define a three component vector Jg as the 3-dimensional dual of Jjk 

Jjk = tjkiJi (127) 
Thus, we get (after restoring the factors of H) 

Je = -^^tjk(xjdk — Xkdj) + -etjko-jk 

. hi 
= in eijk Xjdk + - {-^tijkGjk) 



Jt = (xApj + 



h 



(128) 



The first term is clearly the orbital angular momentum and the second term can 
be regarded as the spin. With this definition, it is straightforward to check that 

the spinors which are solutions of the Dirac equation, carry spin one-half. 

2.6 Transformation Properties of Field Bilinears in the 
Dirac Theory 

We will now consider the transformation properties of a number of physical 
observables of the Dirac theory under Lorentz transformations. Let 

= A£ x v (129) 

be a general Lorentz transformation, and 5(A) be the induced transformation 
for the Dirac spinors tp a {x) (with a = 1, ... ,4): 

i>' a (x>) = S(A) ab Mx) (130) 

Using the properties of the induced Lorentz transformation 5(A) and of the 
Dirac 7-matrices, is straightforward to verify that the following Dirac bilinears 
obey the following transformation laws: 

1. 

V>V ) VV ) = il>{x) tp(x) (131) 
which transforms as a scalar. 

2. 

V>V ) 75 ip'(x' ) = det Atp(x) 75 ip(x) (132) 
which transforms as a pseudo-scalar. 
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3. 

V>V ) 7 M ip'(x' ) = K ip(x) i v ip(x) (133) 
which transforms as a vector. 

4. 

$V ) 757 M V>V ) = det A A£ $(jc) 7 57 1/ ip(x) (134) 
which transforms as a pseudo-vector. 

5. 

V>V ) ct"" V>V ) = A£ k v (i il>(x) a af3 $(x) (135) 
which transforms as a tensor 

Above we have denoted by A^ a Lorentz transformation and det A is its deter- 
minant. We have also used that 

S*- 1 (A) 75 S*(A) = det A 75 (136) 

Thus, the Dirac algebra provides for a natural basis of the space of 4 x 4 matrices, 
which we will denote by 

r 3 ^/, rv = 7 „, r^ = 7 ^, r^ = 757M , r p = 75 (137) 

where S, V, T, A and P stand for scalar, vector, tensor, axial vector (or pseudo- 
vector) and parity respectively. For future reference we will note here the fol- 
lowing useful trace identities obeyed by products of Dirac 7-matrices 

1. 

trJ = 4, tr 7M = tr 75 = 0, trj^ u = (138) 
2. If we denote by a M and two arbitrary 4- vectors, then 

m= a^V -ia^ a^b h ', and tr <jt1j = 4a ■ b (139) 

2.7 Lagrangian for the Dirac Equation 

We now seek a Lagrangian density C for the Dirac theory. It should be a 
local differentiable functional of the spinor field *Sf. Since the Dirac equation is 
first order in derivatives and it is Lorentz covariant, the Lagrangian should be 
Lorentz invariant and first order in derivatives. A simple choice is 

C = &(i0 - m)* = - m^f (140) 

where ^>0^> = ty(0ty) — (d^)^^. This choice satisfies all the requirements. 
The equations of motion are derived in the usual manner, i.e., by demanding 
that the action S = J d 4 x C be stationary 
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The equations of motion are 



O a ; = 



5,,,—^- = 



(142) 



By direct substitution we find 



and 



(0 - m)tf = (143) 



V(i0 +m) = (144) 



which is the adjoint of the Dirac equation. Here, indicates that the derivatives 
are acting on the left. 

Finally, we can also write down the Hamiltonian density that follows from the 
Lagrangian of Eq. 140. As usual we need to determine the canonical momentum 
conjugate to the field "J/, i.e., 

n (*) = = = (145) 

Thus the Hamiltonian density is 

h = n(sc)tf (jc) - c = i*7°a * - c 

= • v* + m** 

= * f ^(3 • v + ™/?) * 

S v ' 

^Dirac 

(146) 

Thus we discover that the "one-particle" Dirac Hamiltonian i?Dirac of Eq. 72 
appears naturally in the field theory as well. Since this Hamiltonian is first order 
in derivatives (i.e., in the "momentum), unlike its Klein-Gordon relative, it is 
not manifestly positive. Thus there is a question of the stability of this theory. 
We will see below that the proper quantization of this theory as a quantum 
field theory of fermions solves this problem. In other words, it will be necessary 
to impose the Pauli Principle for this theory to describe a stable system with 
an energy spectrum that is bounded from below. In this way we will see that 
there is natural connection between the spin of the field and the statistics. This 
connection is known as the Spin-Statistics Theorem. 



2.8 Classical Electromagnetism as a field theory 

We now turn to the problem of the electromagnetic field generated by a set of 
sources. Let p(x) and j(x) represent the charge density and current at a point 
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x of space-time. Charge conservation requires that a continuity equation has to 
be obeyed. 

^+V-J=0 (147) 

Given an initial condition, i.e., the values of the electric field E(x) and the 
magnetic field B(x) at some t a in the past, the time evolution is governed by 
Maxwell's equations 



VB = 



(148) 







It is possible to recast these statements in a manner in which (a) the relativistic 
covariance is apparent and (b) the equations follow from a Least Action Princi- 
ple. A convenient way to see the above is to consider the electromagnetic field 
tensor F^ v which is the (contravariant) antisymmetric real tensor 



In other words 



F 0i 



( 





-E 1 


-E 2 


-E 3 \ 




E 1 





-B 3 


B 2 




E 2 


B 3 





-B 1 


V 


E 3 


-B 2 


B 1 


o / 



(149) 



_ _piO _ _pi 



where e*- 7 '* is the Levi-Civita tensor: 

1 if (ijk) is an even permutation of (123) 



ijk 



— 1 if (ijk) is an odd permutation of (123) 
otherwise 



The dual tensor F is defined by 

pnv _ 



pvn 



2 pa 

where e MI/pcr is the fourth rank Levi-Civita tensor. In particular 

B 1 E 3 -E 2 
B 2 -E 3 E 2 
B 3 E 2 -E 2 



pixv _ 



(150) 



(151) 



(152) 



(153) 



With these notations, we can rewrite Maxwell's equations in the manifestly 
covariant form 



j v (Equation of Motion) 

(Bianchi Identity) 

(Continuity Equation) 



(154) 
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At this point it is convenient to introduce the vector potential A 1 * whose con- 
travariant components are 

A"(x) = (^-,a) = {*,a) (155) 

where $ is the scalar potential, and the current 4- vector j fl (x) 

^(x) = (pc,j) = (f,j) (156) 
The electric field strength E and the magnetic field B we defined to be 

E = — V A -- — 
c c at 

B = v x A 

(157) 

In a more compact notation we write 

In terms of the vector potential A^, Maxwell's equations have the following 
additional symmetry 

A»(x) -» A»(x) + d»A(x) (159) 

(where A(x) is an arbitrary smooth function of space-time). It is easy to check 
that, under the transformation of equation (42), the field strength remains un- 
changed i.e., 

F» v -» F"" (160) 

This property is called Gauge Invariance and it plays a fundamental role in 
modern physics. By directly substituting the definitions of B and E in terms of 
A^ into Maxwell's equations, we get the wave equation. Indeed 

diiF i*» = y ^ d^A" - d"A») = f (161) 

which yields 

UA" -d»{d^)=f (162) 

This is the wave equation. We can further use the gauge-invariance to further 
restrict A* 1 (without these restrictions A^ is not completely determined) . These 
restrictions are known as the procedure of fixing a gauge. The choice 

dpA* = (163) 

known as the Lorentz gauge, yields the simpler wave equation 

U^=f (164) 
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Another "popular" choice is the radiation (or Coulomb) gauge 

V-1=0 (165) 

which yields (in units with c = 1) 

\JA» -d»(d A°)=f (166) 

In the absence of external sources (j v = 0) we can further make the choice 
A° = 0. This choice reduces the set of three equations (one for each spacial 
component of A) which satisfy 

UA = 
X7-A = 

(167) 

The solutions are plane waves of the form 

A{x) = Ae i{ P° x °- p -^ (168) 

which are only consistent if p\ — p 2 = and p- A = 0. This choice is also known 
as the transverse gauge. 

We can also regard the electromagnetic field as a dynamical system and 
construct a Lagrangian picture for it. Since Maxwell's equations are local and 
gauge invariant, we should demand that the Lagrangian density should have the 
same properties. A simple choice is 

C = —F^F^ - j^A* + gauge fixing terms (169) 

This Lagrangian density is manifestly Lorcntz invariant. Gauge invariance is 
satisfied if and only if is a conserved current {d^j^ — 0) since under a gauge 
transformation A^ — > A^ + d fl A(x) the field strength does not change but the 
source term does. 

J d 4 xj^ - J d 4 x [j^+j^A] 

= J d 4 x j^A* + J d 4 x 0/*(j„A) - J d 4 x 0/% 



A 

(170) 



If the sources vanish at infinity, (lim| x |_ >00 = 0), the surface term can be 
dropped. Thus the action S = J d 4 x£ is invariant if and only if 

V = ( m ) 

We can now derive the equations of motion by demanding that the action S be 
stationary, i.e., 



SS 



Jd*x[^5A» + J^ I Sd^}=0 (172) 
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Once again, we can integrate by parts to get 

SC / SC 



5S= f d 4 x d v — ; — SA^ + f d 4 x SA» 
J [5d»A» J J 



5A» V 6d v A» 



(173) 



If we demand SA^ | sur f ace = 0, we get 

— =d , 



i" = -a,^ (177) 



Explicitly, we get 
and 

Thus, we find 
or, equivalently 

j" = dpF*" (178) 
Therefore, the Least Action Principle implies Maxwell's equations. 

2.9 The Landau Theory of Phase Transitions as a Field 
Theory 

We now turn back to the problem of the statistical mechanics of a magnet which 
was introduced in the first lecture. In order to be a little more specific, we will 
consider the simplest model of a ferromagnct: the classical Ising model. In 
this model, one considers an array of atoms on some lattice (say cubic). Each 
atom is assumed to have a net spin magnetic moment S. From elementary 
quantum mechanics we know that the simplest interaction among the spins is 
the Hciscnbcrg exchange Hamiltonian 

H = -J2^S(i)-S(j) (179) 

<ij> 

where < ij > are nearest neighboring sites on the lattice. In many situations in 
which there is magnetic anisotropy, only the z-componcnt of the spin operators 
play a role. The Hamiltonian now reduces to the Ising case Hj 

H I = -JY J <r(i)<r(j) = E{a) (180) 

<ij> 

where a(i) is the z-projection of the spin at site i. 

The equilibrium properties of the system are determined by the partition 
function Z 

* = E«p(-^) ( 181 ) 
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where T is the temperature and {a} is the set of all spin configurations. 

In the 1950's, Landau developed a method (or rather a picture) to study 
these type of problems which in general, are very difficult. Landau first pro- 
posed to work not with the microscopic spins but with a set of coarse-grained 
configurations. One way to do this ( this approach is actually due to Kadanoff 
and Wilson) is to subdivide a large system of size L into regions or blocks of 
linear size I such that ao << £ << L. Each one of these regions will be cen- 
tered around a site, say x. We will denote such a region by A(x). The idea is 
now to perform the sum, i.e., the partition function Z, while keeping the total 
magnetization of each region M(x) fixed 

M ^ = ^M E °W (182) 

where N[A] is the number of sites in A(x). The restricted partition function is 
now a functional of M{x). 

Z[M] = ^exp{-^} Y[S \M(x) - — Yl a (y) J ( 183 ) 

The variables M(x) have the property that, for N(A) very large, they take 
values on the real numbers. Also, the configurations {M(x)} are much more 
smooth than the configurations {a}. 
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At very high temperatures the average magnetization < M > is zero since 
the system is paramagnetic. On the other hand, if the temperature is low, the 
average magnetization may be non-zero since the system may now be ferro- 
magnetic. Thus, at high temperatures the partition function Z is dominated 
by configurations which have < M >= while at very low temperatures, the 
most frequent configurations have < M >^ 0. Landau proceeded to write down 
an approximate form for the partition function in terms of sums over smooth, 
continuous, configurations M{x) which can be represented in the form 



J VM( 



r E(M(x),T), 
x) cxp[ V l h ' ] (184) 



where VM(x) is a measure which means "sum over configurations." If the rel- 
evant configurations are smooth and small, the energy functional E{M) can be 
written as an expansion in powers of M(x) and of its space derivatives. This is 
the Landau-Ginzburg (LG) form of the energy and it is given by 



F(M) = ^1 



d d x 



\k{T)\^M(x)\- + a -^M\x) + b -^M\x) + 



(185) 



Thermodynamic stability requires that the stiffness K(T) and the non-linearity 
coefficient b(T) be positive. The second term has a coefficient a(T) with can 
have either sign. A simple choice of parameters is 

K{T) w K 
b(T) w b 
a(T) w a(T-T c ) 

(186) 

where T c is an approximation to the critical temperature. 

The free energy F(M) defines a Classical (or Euclidean) Field Theory. In 
fact, by rescaling the field M(x) in the form 

$(z) = VKM(x) (187) 
we can write the free energy as 

F(<S>) = Jd d x |I(v<f>) 2 + C/($)| (188) 



where the potential U($) is 



tf($) = ^*2 + A$4 + _ (lg9) 
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where fh 2 = and A = ^ • Except for the absence of the term involving the 
canonical momentum II 2 (x), has a striking resemblance to the Hamiltonian 
of a scalar field in Minkowski space! We will see below that this is not an 
accident. 



U(D) 




Let us now ask the following question: is there a configuration Q c (x) which 
gives the dominant contribution to the partition function Z1 If so, we should 
be able to approximate 

Z = J V<5> exp{-F($)} w exp{-F($ c )}{l + • • • } (190) 

This statement is usually called the Mean Field Approximation. Since the inte- 
grand is an exponential, the dominant configuration $ c must be such that F has 
a (local) minimum at $ c . Thus, configurations <& c which leave F(&) stationary 
are good candidates (we actually need local minima!). The problem of finding 
extrema is simply the condition SF = 0. This is the same problem we solved 
for classical Field Theory in Minkowski space. Notice that in the derivation of 
F we have invoked essentially the same type of arguments: (a) invariance and 
(b) differentiability. The Euler-Lagrange equations can be derived by using the 
same arguments that we employed in the context of a scalar field theory. In the 
case at hand they are 

SF 5F „ , . 

-sW) +Vl s^W)-° (191) 
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For the case of the Landau theory we get 



= - V 2 *c(£) + m 2 $ c (x) + -<P 3 c (x) 



(192) 



The solutions & c (x) that minimize the energy are uniform in space and thus 
have Vi^c = 0. Hence, $ c is the solution of the very simple equation 



Since A is positive and m 2 may have either sign, depending on whether T > T c 
or T < T c , we have to explore both cases. 

For T > T c , fh 2 is also positive and the only real solution is $ c = 0. This is 
the paramagnetic state. But, for T <T cl m 2 is negative and two new solutions 
are available, namely 



These are the solutions with lowest energy and they are degenerate. They both 
represent the magnetized state. 

We now must ask if this procedure is correct, or rather when can we expect 
this approximation to work. It is correct T — and it will also turn out to be 
correct at very high temperatures. The answer to this question is the central 
problem of the theory of Critical Phenomena which describes the behavior of 
statistical systems in the vicinity of a continuous (or second order) phase tran- 
sitions. It turns out that this problem is also connected with a central problem 
of Quantum Field theory, namely when and how is it possible to remove the 
singular behavior of perturbation theory , and in the process remove all depen- 
dence on the short distance (or high energy) cutoff from physical observables. 
In QFT this procedure amounts to a definition of the continuum limit. The 
answer to these questions motivated the development of the Renormalization 
Group which solved both problems simultaneously. It will be a central theme 
in Physics 583. 

2.10 Analytic Continuation: Classical Field Theory and 
Classical Statistical Mechanics. 

We are now going to discuss a mathematical "trick" which will allow us to 
connect field theory with classical statistical mechanics. Let us go back to the 
action for a real scalar field $(x) in D = d + 1 space-time dimensions 



m 2 $ c + -$3 = 







(193) 




(194) 




(195) 



where d 



x is 



dPx = dx d d . 



x 



(196) 
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Let us formally carry out the analytic continuation of the time component xo 
of from real to imaginary time x d 



Xq — > -IX D 



(197) 



under which 



${x ,x) -> $(x,xd) = $(z) 



(198) 



where x = (f,iii). Under this transformation, the action (or rather i times the 
action) becomes 



iS = i j dx d d x £($, <9 *,V ,•*) -» y d D a;r($,-i02,$,Vj*) 



If £ has the form 



£ = ^*) 2 - = i(d <I>) 2 - i(v*) 2 - 



then the analytic continuation yields 



c($,-id D *, v *) - -i(az,$) 2 - I(y$) 2 - 



Then we can write 



X —> -IX D 



\(d D ®) 2 + \(^) 2 + V(<S>) 



(199) 



(200) 



(201) 



(202) 



This expression has the same form as (minus) the potential energy E((fr) for a 
classical field $ in D = d + 1 space dimensions. However it is also the same as 
the energy for a classical statistical mechanics problem in the same number of 
dimensions i.e., the Landau- Ginzburg free energy of the last section. 

In Classical Statistical Mechanics, the equilibrium properties of a system 
are determined by the partition function. For the case of the Landau theory of 
phase transitions the partition function is 



(203) 



where the symbol "J 2?$" means sum over all configurations (we will discuss 
the definition of the "measure" 2?$ later on) . If we choose for energy functional 
E(&) the expression 



where 



£($) = J d D x i(<9$) 2 + V($) 

(d$) 2 = (d D t>) 2 + (v*) 2 



(204) 
(205) 
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we see that the partition function Z is formally the analytic continuation of 

(206) 

where we have used h which has units of action (instead of the temperature) . 

What is the physical meaning of Z1 This expression suggests that Z should 
have the interpretation of a sum of all possible functions $(x, t) {i.e., the 
histories of the configurations of the field $) weighed by the phase factor 
exp{|5(<I>, 9^$)}. We will discover later on that if T is identified with the 
Planck constant h ( which is dimensionally consistent), then Z represents the 
path-integral quantization of the field theory! Notice that the semiclassical limit 
h —* is formally equivalent to the low temperature limit of the statistical me- 
chanical system. 

The analytic continuation procedure that we just discussed is known un- 
der the name of a Wick rotation. It amounts to a passage from D = d + 1- 
dimensional Minkowski space to a Z?-dimensional Euclidean space. We will find 
that this analytic continuation is a very powerful tool. As we will see below, a 
number of difficulties will arise when the theory is defined directly in Minkowski 
space. Primarily, the problem is the presence of ill-defined integrals which are 
given precise meaning by a deformation of the integration contours from the real 
time ( or frequency) axis to the imaginary time (or frequency) axis. The defor- 
mation of the contour amounts to a definition of the theory in Euclidean rather 
than in Minkowski space. It is an underlying assumption that the analytic con- 
tinuation can be carried out without difficulty. Namely, the assumption is that 
the result of this procedure is unique and that, whatever singularities may be 
present in the complex plane, they do not affect the result. It is important to 
stress that the success of this procedure is not guaranteed. However, in almost 
all the theories that we know of, this assumption seems to hold. The only case 
in which problems are known to exist is the theory of Quantum Gravity (which 
we will not discuss here). 
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